Scatterplot layout for high-dimensional data visualization
نویسندگان
چکیده
Multi-dimensional data visualization is an important research topic that has been receiving increasing attention. Several techniques that apply scatterplot matrices have been proposed to represent multi-dimensional data as a collection of two-dimensional data visualization spaces. Typically, when using the scatterplot-based approach it is easier to understand relations between particular pairs of dimensions, but it often requires too large display spaces to display all possible scatterplots. This paper presents a technique to display meaningful sets of scatterplots generated from high-dimensional datasets. Our technique first evaluates all possible scatterplots generated from high-dimensional datasets, and selects meaningful sets. It then calculates the similarity between arbitrary pairs of the selected scatterplots, and places relevant scatterplots closer together in the display space while they never overlap each other. This design policy makes users easier to visually compare relevant sets of scatterplots. This paper presents algorithms to place the scatterplots by the combination of ideal position calculation and rectangle packing algorithms, and two examples demonstrating the effectiveness of the presented technique.
منابع مشابه
A constraint-based layout approach to data visualization
Fig. 1. A scatterplot with extreme overplotting (left), with random jitter (middle) and the result from our constraint-based layout framework (right). Abstract—This work explores the connection between statistical data visualization problems and simulated physical systems. Applying techniques for constraint-based layout, data points are treated as physical objects under the influence of forces ...
متن کاملQuality Metrics Driven Approach to Visualize Multidimensional Data in Scatterplot Matrix
Extracting meaningful information out of vast amounts of high-dimensional data is challenging. Prior research studies have been trying to solve these problems through either automatic data analysis or interactive visualization approaches. Our grand goal is to derive representative and generalizable quality metrics and to apply these to amplify interesting patterns as well as to mute the uninter...
متن کاملMeasuring Insight into Multi-dimensional Data from a Combination of a Scatterplot Matrix and a HyperSlice Visualization
Understanding multi-dimensional data and in particular multi-dimensional dependencies is hard. Information visualization can help to understand this type of data. Still, the problem of how users gain insights from such visualizations is not well understood. Both the visualizations and the users play a role in understanding the data. In a case study, using both, a scatterplot matrix and a HyperS...
متن کاملQuality-Based Visualization Matrices
Parallel coordinates and scatterplot matrices are widely used to visualize multi-dimensional data sets. But these visualization techniques are insufficient when the number of dimensions grows. To solve this problem, different approaches to preselect the best views or dimensions have been proposed in the last years. However, there are still several shortcomings to these methods. In this paper we...
متن کاملLayout Based Visualization Techniques for Multi Dimensional Data
In this paper we present an overview for the interactive visualization of structural information in tabular multidimensional data. We first provide an overview of methods for layout of high dimensional data. Then we discuss a number of interactive visualization methods that can be used to present the structure of these high dimensional spaces. We discuss the use of the described methods in thre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Visualization
دوره 18 شماره
صفحات -
تاریخ انتشار 2015